ON THE RING OF APPROXIMATION TRIPLES 
ATTACHED TO A CLASS OF EXTREMAL REAL NUMBERS 



DAMIEN ROY AND ERIC VILLANI 

Abstract. We attach a ring of sequences to each number from a certain class of extremal 
real numbers, and we study the properties of this ring both from an analytic point of view 
by exhibiting elements with specific behaviors, and also from an algebraic point of view by 
identifying it with the quotient of a polynomial ring over Q. The link between these points of 
view relies on combinatorial results of independent interest. We apply this theory to estimate 
the dimension of a certain space of sequences satisfying prescribed growth constrains. 



1. Introduction 

Let 7 = (1 + Vo)/2 denote the golden ratio. In [3], H. Davenport and W. M. Schmidt 
proved that, for each real number £ which is neither rational nor quadratic irrational, there 
exists a constant c > with the property that, for arbitrarily large real numbers X, the 
system of inequalities 

(1) \x \<X, |x f-xi| < cX~ 1/7 , |x £ 2 -x 2 \ < cX- lh 

has no non-zero solution x = [xq^x^xq) G Z 3 . Because of this, we say that a real number 
£ is extremal if it is neither rational nor quadratic irrational and if there exists a constant 
d > such that the system with c replaced by d has a non-zero solution (x ,Xi,x 2 ) G 
Z 3 for each X > 1. The existence of such numbers is established in [6J [7], showing in 
particular that the exponent I/7 in the result of Davenport and Schmidt is best possible. 
Among these numbers are all real numbers whose continued fraction expansion is the infinite 
Fibonacci word constructed on an alphabet consisting of two different positive integers [6] 
(or a generalized such word constructed on two non-commuting words in positive integers 
[TO]). This connection with symbolic dynamics is extended by M. Laurent and Y. Bugeaud 
in [2], and stressed even further in recent work of S. Fischler jH [5]. 

Schmidt's subspace theorem implies that any extremal real number is transcendental (see 
[TT1 Chap. VI, Theorem IB]). Using a quantitative version of the subspace theorem, B. Adam- 
czewski and Y. Bugeaud even produced a measure of transcendence for theses numbers, 
showing that, in terms of Mahler's classification, they are either S or T numbers [H Theo- 
rem 4.6]. The purpose of the present paper is to provide tools which may eventually lead to 
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sharper measures of approximation to extremal real numbers either by all algebraic numbers 
or by more restricted types of algebraic numbers (like in [8]). 

As shown in [7], any extremal real number comes with rigid sequences of integer triples 
(x ,Xi,x 2 ) satisfying a stronger approximation property than that required by ([1]). In the 
next section, we show that, for each extremal real number in some large family, this naturally 
gives rise to a finitely generated ring of sequences over Q. We study this ring in Sections [2] 
and [31 both from an analytic point of view by exhibiting elements with specific behaviors, 
and also from an algebraic point of view by showing that it is isomorphic to the quotient of 
a polynomial ring in six variables over Q by an ideal I with three explicitly given generators. 
The link between these points of view relies on two combinatorial results of independent 
interest that are stated at the beginning of Sections [2] and [31 and proved in Section HI In 
Section [51 we apply this theory to estimate the dimension of a certain space of sequences 
with restricted growth. Following a suggestion of Daniel Daigle, we conclude in Section [6] 
with a complementary result showing that the ideal / mentioned above is a prime ideal of 
rank 3 and thus, that the ring of sequences attached to the extremal real numbers under 
study is an integral domain of transcendence degree 3. 

2. The ring of approximation triples 

2.1. A combinatorial result. We denote by N the set of non-negative integers, and by 
N* = N \ {0} the set of positive integers. We also denote by /: Z — > Z the function 
satisfying 

(2) /(0) = f(l) = 1 and f(i + 2) = f(i + 1) + f(i) for each i G Z, 

so that (/(i))i>o is simply the Fibonacci sequence. In order not to interrupt the flow of the 
discussion later, we start by stating the following crucial combinatorial result whose proof is 
postponed to Section H] and whose relevance will be made clear shortly. 

Theorem 2.1. Let d G N. For each s G N, denote by Xd(s) the number of points (m, n) G Z 2 
for which the conditions 

s s 

(3) (m, n) = ^ (/(-»*)> ~ !)) and ^2 ^ - d 

k=l k=l 

admit a solution integers < i\ < ■ ■ ■ < i s , and s is maximal with this property. Then, 

{2s + 1 if 0<s <d, 
d + l if s = d, 
if s > d. 

Here, and throughout the rest of this paper, we agree that an empty sum is zero, so that 
for (m, n) = (0, 0), the conditions ([3]) are satisfied with s = 0. 
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For our purposes, we need to recast this result in the following context. Consider the 
subring of R generated by 7. Since 7 = 1 + I/7 and (I/7) 2 = 1 — (I/7), we note that 
Z[7] = ^[1/7] = Z © Z • (I/7) is a free Z-module with basis {1, 1/7}. The formulas 

(5) r l = f(~i) + /(-< - l)/7 = ("!)'(/(* - 2) - f(i - l)/ 7 ), 

which follow from a quick recurrence argument, show that, for any point a = m+n/'-f £ Z[7], 
the conditions d2J) are equivalent to 

(6) a = 1 ~ h + --- + 1 ~ is and f{h) + --- + f{i s )<d. 

For each d 6 N, let £^ denote the set of points a £ Z[7] for which these conditions admit 
a solution in integers < i\ < ■ ■ ■ < i s for some s £ N and, for each a £ Ed, let s^a) 
denote the largest value of s for which such a solution exists. Then, Theorem 12.11 can be 
restated by saying that, for each pair of integers d, s £ N, the number of elements a of E^ 
with Sd(a) = s is the integer given by (j4"l). It is in this form that Theorem 12.11 will be 

proved in Section HJ together with a more compact description of the sets E^. 

2.2. A class of extremal numbers. We first recall that a real number £ is extremal if and 
only if there exists an unbounded sequence of points = (^0,^,1,^,2) in Z 3 , indexed by 
integers k > 1 in N*, and a constant C\ > 1 such that, for each k > 1, the first coordinate 
a^o of Xfc is non-zero and we have 

El: q 1 |x fc) o| 7 < |x fc+ i,o| < Ci|x fci o| 7 , 

E2: max {|x fei o£ - x k ,i\, \x k ,o^ 2 - x kt2 \] < Ci|x Ai0 | _1 , 

E3: 1 < |x fci o^fc,2 - < Ci, 

E4: 1 < I det(x fc ,x fc+ i,x fc+2 )| < c x . 

This follows from Theorem 5.1 of [7] upon noting that the condition E2 forces the maximum 
norm of x^ to behave like \x k $\- As in [7J, it is convenient to identify each triple x = 
(xo> £1, £2) with coefficients in a commutative ring with the symmetric matrix 

x= (x xA 

\Xl x 2 J 

Then, the condition E3 reads simply as 1 < |det(xfc)| < C\. We also recall that, for a 
given extremal real number £, the corresponding sequence {x k )k>i is unique up to its first 
terms and up to multiplication of its terms by non-zero rational numbers with bounded 
numerators and denominators (see Proposition 4.1 of [9]). Moreover, for such £ and such a 
sequence ('x k ) k >i viewed as symmetric matrices, Corollary 4.3 of [9] ensures the existence of 
a non-symmetric and non-skew-symmetric 2x2 matrix 

(7) M=( ai ' 1 ai A 

\a2,i a 2 ,2j 
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M k 



+1 



with integer coefficients such that, for each sufficiently large k > 1, the matrix x^, +2 is a 
rational multiple of x^+iMfc+iXfe where 

' M if fc is odd, 
*M if k is even, 

and where t M denotes the transpose of M. The present paper deals with a special class of 
extremal numbers. 

Hypothesis 2.2. In the sequel, we fix an extremal real number a corresponding sequence 
(xfc)fc>i and a corresponding matrix M satisfying the additional property that, for each k > 1, 
we have 

(9) det(x fc ) = 1 and x fc+2 = ^ k+1 M k+ i^k- 

The first condition det(xfc) = 1 is restrictive as there exist extremal real numbers with 
no corresponding sequence (itk)k>i m SL 2 (Z), but it is not empty as it is fulfilled by any 
real number whose continued fraction expansion is given by an infinite Fibonacci word con- 
structed on two non-commuting words in positive integers, provided that both words have 
even length (see [H])- The second condition however is no real additional restriction. It is 
achieved by omitting the first terms of the sequence {xk)k>i if necessary, by choosing M so 
that x 3 = x 2 Mx! and then by multiplying recursively each x& with k > 4 by ±1 so that 
the second equality in (J9j) holds for each k > 1. In particular, we have det(M) = 1. Since 
t M ^ ±M, this in turn implies that we have ax t \ ^ or a 2i2 7^ 0. 

2.3. A subring of the ring of sequences. Let S denote the ring of sequences of real 
numbers {dk)k>i indexed by the set N* of positive integers, with component-wise addition 
and multiplication, and let & denote the quotient of S by the ideal So of sequences with 
finitely many non-zero terms. Two sequences (a k )k>i and (bk) k >i in S thus represent the 
same element of & if and only if = bk for each sufficiently large integer k. We view K as 
a subring of & by identifying each x G K. with the image of the constant sequence (x)k>i 
modulo So- This gives & the structure of an M-algebra. By restriction of scalars, we may 
also view © as a Q- algebra. In particular, given elements 2ti, . . . , 21^ of & we can form the 
sub-Q-algebra Q[2li, . . . , 21^] that they generate. 

For each i G Z and each j = 0, 1, 2, we define an element 3^ of (3 by 

(10) Xf = class of (x 2k +i,j) k y 1 in 6, 

where for definiteness we agree that X2k+i,j = when 2k + % < 0, although the resulting 
element of (5 is independent of this choice. Clearly, %^ +2 ^ differs simply from by a shift, 
but nevertheless they are quite different from the algebraic point of view. For each i e Z, 
we also define a triple 

(11) £« = (£^, Xf, Xf) G 6 3 . 
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Identifying these triples with 2x2 symmetric matrices according to our general convention, 
and using (jHJ) to extend the definition of Mk+i to all integers k, Hypothesis 12.21 gives 

(12) det(X (i) ) = 1 and X (l+2) = X (m) M m X w 

for each i 6 Z. Our goal in this paper is to study the sub-Q-algebra Q[X^°\ X^ 1 )] of 6. For 
this purpose, we form the polynomial ring Q[X, X*] in six indeterminates 

X=(X ,X l5 X 2 ) and X* = (X*, X*, X*). 

We first note that Q^X*- - 1 , X^ 1 )] contains the coordinates of X^ for each j £ Z. 

Lemma 2.3. For each i£Z, the coordinates o/X^ - *-* can be written as values at the point 
(3£(°\ X^ 1 )) of polynomials o/Q[X, X*] that are separately homogeneous of degree \ f(i — 2)| 
in X and homogeneous of degree — 1)| in X*. 

Proof. By (fl2l) . the coordinates of X^ +2 - ) are bilinear forms in (X^ +1 \ X^) for each iGZ. 
Since each matrix X^Mj has determinant 1, its inverse is its adjoint. Thus we find 



j(-*- 2 ) = Adj(X ( ^ 1) Af_ l _ 1 ) X H) , 

showing that, for each i 6 Z, the coordinates of X*- - * -2 -* are also bilinear in (X^ J_1 \ X*" 2 - 1 ). 
By recurrence, this implies that, for each i > 0, the coordinates of X (_i ' ) (resp. jfW) are the 
values at (X^°\ X^ 1 ^) of bi-homogeneous polynomials of bi-degree (f(i — 2), f[i — 1)) (resp. 
(f(i), f{i — 1))). The conclusion follows since, for the same values of i, the formulas ([5]) give 
/(i) = |/H - 2)1 and /(i - 1) = |/H - 1)|. □ 

The preceding result implicitly uses the natural surjective ring homomorphism 

tt: Q[X,X*] — > Q[X(°), X^ 1 )] 
1 j P(X,X*) i— > P(X(°),X(" 1 )). 

Our first goal is to describe the kernel of this map. 

Lemma 2.4. The kernel of tt contains the ideal I o/Q[X, X*] generated by the polynomials 

det(X) - 1 = X X 2 - X\ - 1, 



(14) 



det(X*) - 1 = X*X* - (XI 



*\2 



$(X,X*) =a M 



X* X* 
X X\ 



+ ai,2 



X* X* 
X Xi 



+ 02, 



X* X* 
Xx X 2 



+ «2,2 



X* X* 

Xi x 2 



Proof. The first equality in (TT2]) tells us that det(X^) = 1 for each i £ Z. Applying this 
with 2 = and % = — 1, we deduce that det(X) — 1 and det(X*) — 1 belong to the kernel of 
tt. On the other hand, the second equality in (|T2"|) gives X^ 1 -* = X'°'Ml' _1 ' and so 



T (i) 

T (i) T (i) 




X, 



(0) 



(0) 




a 2,l 0-2,2 




(-1) 



(-1) 



Xi" 

x^ 
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In particular, the matrix product on the right gives rise to a symmetric matrix. This fact 
translates into $(X (0) , X ( ~ 1} ) = 0, and so $ G ker(7r). □ 

We will see below that I is precisely the kernel of tt. In Section [H] we will provide an 
alternative proof of this, suggested by Daniel Daigle, showing moreover that / is a prime 
ideal of Q[X,X*] of rank 3, and therefore that Q[X^°\ X^] is an integral domain of 
transcendence degree 3 over Q. 

2.4. Asymptotic behaviors. The units of 6 are the elements of & which are represented 
by sequences (ak)k>i with ak ^ for each sufficiently large k. We define an equivalence 
relation ~ on the group (5* of units of & by writing 21 ~ 23 when 21 and 23 are represented 
respectively by sequences (ak)k>i and (bk)k>i with lim^oo ak/bk = 1. Then, for each j e Z, 
the condition E2 implies that 

(15) 3$~£3$ and X?~£ 2 X?. 

For this reason, we regard the points X^ as generic (projective) approximations to the triple 
(1, £, £ 2 ) and, in view of Lemma T2.31 we say that Q^X*- -*, X* -1 - 1 ] is the ring of approximation 
triples to (1,£,£ 2 ). Since the second formula in ( [121) gives 



X? +2) =(xi m) Xf +1) )M, 
we also find that 

(i6) x{; +2) ~0x£ +1) x? 





+J. I Ji) 



where 



6 = (1 f) M i+ i Q = a ltl + (a 1)2 + a 2jl )£ + a 2 , 2 £ 2 



is independent of i and non-zero because £ is transcendental over Q and M is not skew- 
symmetric. 

For each d G N, we denote by Q[X, X*]< d the subspace of Q[X, X*] consisting of all 
polynomials of total degree at most d, and by Q [X^°\ X^ 1 ^] <d its image under the evaluation 
map tt. The next lemma provides a variety of elements of the latter set, with explicit behavior. 

Lemma 2.5. Let d G N. For each a = m+n/j G Ed and each integer j with < j < 2sd{a), 
there exists an element %Jl a ,j of Q[X^°\ X < -~ 1 - I ] <d with 

m ad ~ ^- sd ^e(4 0) ) m (^ 1} ) n . 

Proof. By recurrence, we deduce from (fl6l) that, for each % G Z, we have 
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Now, let a = m + n/7 G Ed, let s = Sd(a), and let j be an integer with < j < 2s. 
By definition, there exist integers < i± < • • • < i s satisfying (J6]). Choose also integers 
ji, . . . , ] s G {0, 1, 2} such that ji + ■ ■ ■ + j s — j- Then, we find 



n x m " ' - " n ° f n ' 1 $Y Hfc) (4 _i) ) /Hfc ~ l) 

fc=l k=l 

Since ([6]) and ([3]) are equivalent, the product on the right is simply Q m+n - s ^ ' (X^\ m (X^ 
The conclusion then follows by observing that, according to Lemma T2.3[ the product on the 
left is the value at X^ -1 )) of some bi-homogeneous polynomial of Q[X, X*] with bi- 

degree J2l=i ~~ 2 )' /(** ~~ 1 ))' and tllus witn total degree J2l=i ^ d - n 
In fact, we claim that the elements 9Jt a ,j constructed in the preceding lemma form a basis 
of Q[£(°\ X(-^] <d . To prove this, we will first show that they are linearly independent over 
Q, and count them using Theorem 12. 1L This will provide a lower bound for the dimension 
of Q^ -*, X* -1 -']^. Next, using the fact that the ideal / is contained in the kernel of 
the evaluation map 71", we will find that the same number is also an upper bound for this 
dimension. This will prove our claim and will bring other consequences as well. We now 
proceed to the first step of this programme. 

2.5. Growth estimates. Let & + denote the subgroup of &* whose elements are represented 
by sequences with positive terms. Given 21, 23 G ©+, we write 21 <C 23 or 23 ^> 21 if there 
exists a constant c > such the corresponding sequences (ak)k>i and {bk)k>\ satisfy au < cbk 
for each sufficiently large index k. We write 21 x 03 if we both have 21 <C 23 and 21 ^> 23. 
The latter is an equivalence relation on & + , and the condition El in §2.21 can be expressed 
in the form 

I 1) I ^ I •¥•(«) IT 
| X I | X I 

for each % G Z, where the absolute value and exponentiation are taken "component- wise" . 
In particular, for any d G M, a = m + n/j G Ed and j G {0, 1, . . . , 2s d (a)}, the element ^Ol a j 
of Q^X^ -*, £ '] <d provided by Lemma [2.51 satisfies 

Ion I ^ I -y(O) | m | I™ ^ | T (0)im+n/ 7 _ I -»(()) I a 

Since each Ed is a finite set of positive real numbers, this leads to the following conclusion. 

Lemma 2.6. Let deN, let r a j (a G Ed, < j < 2sd{ct) ) be rational numbers not all zero, 
let a' — m + n/7 be the largest element of Ed for which at least one of the numbers r a >j 
f0 < j < 2sd{a')) is non-zero, and put s' = Sd{a'). Then, with the notation of Lemma \2. 51 
the linear combination 21 = J2 a &E d X]j=o^ r <*,j satisfies 

* ~ (f2 r -'^ J ) 9m+n ~ s '( x oT(^ 1) ) n and M - |4T'- 

v j= / 
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We are now ready to complete the first step of the programme outlined at the end of the 
subsection 12.41 

Lemma 2.7. Let d G N. The elements 9Jl a ,j ( a Ed, 3 — 0, . . . , 2sd{ot)) constructed 
in Lemma \2. 51 form a Q-linearly independent subset of Q[X^°\ 3^ _1 ^] <d with cardinality 
(4d 3 + 6d 2 + 8d + 3)/3. 

Proof. Lemma 12.61 shows that the elements 9Jl a ,j are linearly independent over Q. By The- 
orem [2TTJ, their number is 

d d-X 

(2s d (a) + 1) = Xd(s)(2s + l ) = ( d + !)( 2rf + 1) + 5> s + ^ 

a£E d s=0 s=0 

□ 

2.6. Computation of an Hilbert function. We introduce a new variable U, make the 
ring Q[X, X*, U] into a graded ring for the total degree, and denote by I\ the homogeneous 
ideal of this ring generated by 

(17) det(X)-[/ 2 , det(X*)-[/ 2 , $(X,X*). 

Then, for each d G N, Lemma [2.41 ensures that we have a surjective Q-linear map 

(Q[x, x*, u]/h) d _Q[a^ 

P(X,X*,U) +h ^p(x (0) , 1). 

In particular, this gives 

(19) dim Q Q[£(°\ £ M) ]< d < H(h;d) := dim Q (Q\X.,X*,U]/Ix) d . 

Thus, in order to complete the programme outlined at the end of the subsection 12.41 it 
remains to compute the Hilbert function H{I\\d) of I\. We achieve this by showing first 
that the generators (fTTj) of 1\ form a regular sequence in Q[X, X*, U\. 

Recall that a regular sequence in a ring R is a finite sequence of elements Ql, • • • , CL n ofR 
such that, for i — 1, . . . , n, the multiplication by aj in R/(a\, . . . , aj_i) is injective (with the 
convention that (ai, . . . , aj_i) = (0) for i — 0). If i? is a polynomial ring in m variables over 
a field, then, for any integer n with 1 < n < m, a sequence of n homogeneous polynomials 
ai, . . . , a n of R is regular if and only if the ideal (ai, . . . , a n ) that it generates has rank (or 
codimension) equal to n. In that case, any permutation of ai, . . . , a n is a regular sequence. 

Lemma 2.8. The polynomials det(X), det(X*) and $(X, X*) form a regular sequence in 

Q[x,x*]. 

Proof. Put R := Q[X, X*]. Since det(X) and det(X*) are relatively prime, they form a regu- 
lar sequence in R. Moreover, the ideal that they generate is the kernel of the endomorphism 
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of R which maps X; to X^Xj and X* to (X*) 2 -*(X*) i for i = 0,1,2. The conclusion 
follows by observing that the image of $(X, X*) under this map is 

(XiXq — XqXD^cii^XoXq + ai^XoXl + 02,1X1X5 + 02,2X1 X*), 

which is a non-zero polynomial. □ 

We can now turn to the ideal I\. 

Lemma 2.9. The generators (ITTj) of I\ form a regular sequence in Q[X, X*,Z7]. For each 
d GN, we have 

H{h; d) = (Ad 3 + 6d 2 + 8d + 3)/3. 

Proof. Since the natural isomorphism Q[X, X*, U]/(U) — > Q[X, X*] induced by the special- 
ization U 1— > maps the sequence of polynomials ( jPTl) to the regular sequence of R studied 
in Lemma [2781 we deduce that U, det(X) — U 2 , det(X*) — U 2 and $(X,X*) form a regular 
sequence in Q[X,X*, U\. Since these are homogeneous polynomials, it follows that the last 
three of them, which generate 1%, form a regular sequence. Since the latter are homogeneous 
of degree 2 and since Q[X, X*, U] is a polynomial ring in 7 variables, the Hilbert series of 
the ideal I\ is given by 

d=0 \ / \ / d=0 \ / 

and a short computation completes the proof. □ 

2.7. Conclusion. Combining the above result with (Tl9l) and Lemma T2.7[ we obtain finally: 

Theorem 2.10. Let d G N. Then, the map (lisp is an isomorphism of vector spaces overQ, 
and the elements 9ft Q j (a G E^, j = 0, . . . , 2sd(ct) ) constructed in Lemma \2. 51 form a basis 
of Q[3l(°\ jE^ -1 )] < The dimension of the latter vector space is (Ad 3 + 6d 2 + 8d + 3)/3. 

Applying first Lemma 12.61 and then the growth estimates of Subsection 12.51 we deduce 
from this the following two consequences. 

Corollary 2.11. Let d G N. For non-zero element 21 of Q[X < -°- ) , •£ <_1 ' ) ] <d > there exists a 
point a = m + 71/7 G -E^ and a polynomial A G Q[T] 0/ degree at most 2sd(ot) such that 

21 ~ ^ m+ "-^( Q )A(0(xJ 0) ) m (Xj," 1 ^. 
Corollary 2.12. For non-zero element 21 of Q[X^°\ X* -1 -'] , i/iere exists a point a G Zfy] 

2lx|xi 0) | a . 

T/ie map 21 1— » a zs a rank two valuation on the ring Q[X^°- ) , X^ 1 ^] . 
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Finally, we note that, for each d G N, the linear map (|18|) factors through the map from 
[X,X*]< d + I)/ 1 to Q[£ (0) , £ M) ] <d induced by vr. Since the former is an isomorphism, 
the latter is also an isomorphism, and so: 



Corollary 2.13. The ideal I defined in Lemma \2.J\ is the kernel of the evaluation map % 
from Q[X, X*] to Q[X( \ X^] . 

3. Analogous results in bi-degree 

The following result is analogous to Theorem 12.11 

Theorem 3.1. Let d = (di,d 2 ) G N 2 . For each s G N, let Xd{s) = X^^i 8 ) denote the 
number of points (m, n) G 1? for which the conditions 

s s s 

(20) (m,n) = X)(/H*)> fHk-l)), 2) <d a and £)/(i fc - 1) < tfe, 

k=l k=l k=l 

admit a solution integers < i\ < ■ ■ ■ < i s , and s is maximal with this property. Then, 



(21) XdiM 



2min{<ii, d 2 , s, d\ + d 2 — s} + 1 if < s < d\ + d 2 , 
if s > d\ + d 2 . 

Note that this function Xdi,d 2 ( s ) possess several symmetries. For each (di,d 2 ) G N 2 and 
each s = 0, . . . , d\ + d 2 , it satisfies 

Xdi,d 2 ( s ) = Xd2,rfi( s ) and Xdi,d 2 ( s ) = Xdi,d 2 ( rf i + rf 2 - s). 
As in Section [2], we note that, for a point a = m + n/7 G Zfy], the conditions fl20|) are 
equivalent to 

(22) a = ^ 7 - lfc , ^/(z fe -2)<4 and £/(z* - 1) < d 2 - 

fe=i fc=i fe=i 

For each d G N 2 , we denote by i?d the set of a G Z[ 7 ] for which these conditions admit a 

solution in integers < i\ < ■ ■ ■ < i s for some s G N and, for each a G Ed, we denote by 

Sd(a) the largest such s. Then, Theorem 13. II tells us that, for given d G N 2 and s G N, the 

number of elements a of Ed with Sd(a) = s is Xd(s) given by fT2"Tl) . This will be proved in 

Section HI 

For each d = (d\,d 2 ) G N 2 , we also denote by Q[X,X*]<d the set of polynomials of 
Q[X, X*] with degree at most d\ in X and degree at most d 2 in X*. We also write 
Q[£(°), 3^~^] <d for the image of that set under the evaluation map n defined by (TT3"j) . 
We can now state and prove the following bi-degree analog of Theorem 12.101 

Theorem 3.2. Let d = {d\,d 2 ) G N 2 . For each a = m + n/7 G Ed and each j = 
0, . . . , 2sd(a), there exists an element Wl' a j ofQ[%(°>, ^~^] <d 
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Any such choice of elements, one for each pair (a,j), provides a basis of Q[X^°\ ^] <d - 
This vector space has dimension {d\ + d 2 + l)(2diG? 2 + d\ + d 2 + 1). 

Proof. The existence of the elements ^Ol' a j is established exactly as in the proof of Lemma 
12.51 upon replacing everywhere the symbol d by d, using (|2"2"|) and f[2"0l instead of and 
([3]). Fix such a choice of elements. The fact that they are linearly independent over Q is 
proved as in §2.5[ upon observing that the statement of Lemma 12.61 still holds when d is 



replaced by d and 9Jt aj j by 9Jl' a j ■ According to Theorem I3.1[ they form a set of cardinality 
Z^-Lo ' 2 Xd(s)(2s + 1). Since Xd(s) = Xd(^i + d 2 - s) for s — 0, . . . , dx + d 2 , this cardinality 
is also given by 

2 Yl (*d(s)(2s + 1) + Xd(s)(2(di + d 2 - s) + 1)) = {d x + d 2 + l)J2 Xd(s). 

s=0 s=0 

A short computation based on the formula fTSTj) shows that the right-most sum is equal to 
2did 2 + di + d 2 + 1 (an alternative approach is to note that this sum is the cardinality of 
and to use Corollary 14.51) . Thus the elements 9R' a j span a subspace of Q[3£(°), X^ 1 ^] <d of 
dimension (di + d 2 + l)(2did 2 + dx + d 2 + 1). To complete the proof, it remains only to show 
that the dimension of Q[£ (0) , <d is no more than this. To that end, we proceed as in 

§2.61 We introduce two new indeterminates V and V* and, for each n = (nx,n 2 ) e N 2 , we 
denote by Q[X, V, X*, V*] n the subspace of Q[X, V, X*, V*} whose elements are homogeneous 
in (X, V) of degree rix and homogeneous in (X*, V*) of degree n 2 . This makes the polynomial 
ring R 2 := Q[X, V, X*, V*] into a N 2 -graded ring. Let I 2 denote the bi-homogeneous ideal of 
R 2 generated by 

(23) det(X)-V 2 , det(X*) - {V*) 2 , $(X,X*). 

Lemma [2.41 ensures that we have a surjective Q-linear map in each bi-degree n 

[x, v, x*, v*yi 2 ) a — Q[x (0) , x ( - 1} ] <n 



(24) 

P(X, V, X*, V*) + h i — > P(X (0) , 1, X ( " 1} , 1). 

In particular, this gives diniQ Q , X^ 1 ^ ) ] <d < H(I 2 ;d) where H(I 2 ;n) stands for the 
Hilbert function of I 2 at n, namely the dimension of the domain of the linear map (1241) . As 
in the proof of Lemma I2.9[ we deduce from Lemma 12.81 that the generators (I23p of I 2 form a 
regular sequence in R 2 . Since these generators are bi-homogeneous of bi-degree (2,0), (0,2) 
and (1, 1), and since the grading of R 2 involves two sets of 4 variables, we deduce that the 
Hilbert series of I 2 is 

V H(I-n n)T n iT n,_ ^-Tx 2 )(l-T 2 2 )(l-T lT2 ) 

= {i-TxT 2 ) K + i) 2 K + i) 2 rrr 2 n2 . 

ni,n,2SN 
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This completes the proof as it implies that 

H(I 2 ; d) = (d a + l) 2 (d 2 + l) 2 - d\d\ = (d 1 + d 2 + \){2d x d 2 + d 1 + d 2 + l). 

□ 

Note that this result implies that the statement of Corollary 12.111 still holds in bi-degree, 
with d replaced by d. 

4. Combinatorial study 

This section is devoted to the proof of Theorems 12.11 and 13.11 As mentioned in §2.11 we 
work within the ring Zfy]. We define 

E = {a e ; a > 0} and E* = E\ {0}, 

and note that, since 7 > 0, the sets Ed and defined respectively in Sections 12 . 1 1 and 131 are 
subsets of E. Our first goal is to provide a more explicit description of these. 

4.1. A partition. We first establish a partition of E*. 

Proposition 4.1. The sets 

£(+) = | m _|_ n ^~ l ; m, n > 1} 
and = {m^~ l + ny~ l ~ 2 ; m > 1, n > 0} for i > 

form a partition E* = E^ [] (]J~ E®) of E* . 

Proof. Consider the bijection ip: Zfy] — > Z 2 which maps a point m + n/7 to its pair of 
coordinates (m,n) relative to the basis {1, 1/7} of Z 2 . It identifies E* with the set of non- 
zero points (m, n) of Z 2 whose argument in polar coordinates satisfies 

— arctan(7) < arg(m, n) < n — arctan(7). 

Using the formulas ([3]), a quick recurrence argument shows that, for each index i > 0, the 
determinant of the points (p(^~ l ) and tp(^~ l ~ 2 ) is 



-1) 



i+l 



/(i-2) 
/(<) -/(< + !) 

This means that {</>(7~ l ), <^(7~*~ 2 )} forms a basis of Z 2 for each % > 0. Since the points 
<^(7~ 2 *) have positive first coordinate, it also means that arg<^(7~ 2j ) is a strictly decreasing 
function of i > starting from arg(/9(7 ) = 0. Finally, since the points <^(7 _2i_1 ) have 
positive second coordinate, it tells us that argyj(7~ 2j ~ 1 ) is a strictly increasing function of 
i > starting from argy?^ -1 ) = 7r/2. In other words, we have 

• ■ ■ < arg<yj(7 -4 ) < argy?(7~ 2 ) < arg<£>(7°) < arg<^(7 _1 ) < arg<^(7 _3 ) < argyj(7~ 5 ) < ■ ■ ■ 

We conclude from this that a point a of E* belongs to E^ if and only if arg</9(7°) < 
arg <£>(oi) < argy?(7 _1 ), and that it belongs to E^ for some i > if and only if arg<^(a) 
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lies between arg<^(7 _J ) and arg<^(7 -1-2 ), with the first end point included and the second 
excluded. In particular the sets and E^' with i > are all disjoint. They cover E* 
because the fact that lim^oo / (j) / f (j — 1) = 7 implies that argyj(7~ 2 *) and arg<^(7~ 2 * -1 ) 
tend respectively to — arctan(7) and 7r — arctan(7) as i — > 00. □ 

With our convention that an empty sum is zero, this implies that: 

Corollary 4.2. Any a G E can be written in the form 
(25) a = 7-* 1 + • • • + j~ is 

for a choice of integers s > and < i\ < ■ ■ ■ < i s . 

We say that a finite non-decreasing sequence of non-negative integers i = (i±, . . . ,i s ) is 
a representation of a point a of E if it satisfies the condition ( l25|) . In particular, the only 
representation of the point is the empty sequence. 

4.2. Degree and bi-degree. For any finite non-decreasing sequence of non-negative inte- 
gers i = (ii, . . . , i s ), we define 

s s s 

d(i) = X) d 1 (i) = J2f(^-2), d 2 (i) = ^/(z fe -l), 

k=l k=l k=l 

d(i) = (di(i), ^2(1)) and s(i) = s. 
We say that d(i), d(i) and s(i) are respectively the degree, bi-degree and size of the point 
i, while G?i(i) and ^(i) are respectively the first and second partial degrees of i. For the 
empty sequence, all these integers are zero. We also put a partial order on N 2 by writing 
(m, n) < {ml ' , n') if m < ml and n < n' . We can now state and prove: 

Proposition 4.3. Let a = m + n/7 G E* and let i = (i 1; . . . ,i s ) be a representation of a. 
Then we have d(i) > \m\ + \n\ and d(i) > {\m\, \n\). Both inequalities are equalities if i s < 1 
or if i\, . . . ,i s share the same parity. Otherwise, they become strict inequalities. Moreover, 
we have (^(i) > \n\ if i\, . . . ,i s contains a pair of positive integers not of the same parity. 

Proof. Since a = 7~ n + • • ■ + 7~* s , the formulas (jSJ) imply that 

s s 

m = ^(-l)«V(i*-2) and n = ^2{-l) i " +1 f{i k -l) 
fc=i fe=i 
From this we deduce that 



< $^/(^-2) =di(i) and \n\ < - 1) = d 2 (i), 

k=l k=l 

and the conclusion follows because /(— 2) = 1, /(— 1) = and f(i) > 1 for each i G N. □ 



Since, by Proposition 14. 1[ each a £ E* admits a representation i = (ii, . . . , i s ) with i s < 1 
or with zi, . . . , i s of the same parity, we deduce that 
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Corollary 4.4. Each a = m + n/7 G E admits a representation with largest degree d(a) : = 
I m I + \n\ and largest bi-degree d(a) := (\m\, \n\). 

We say that the integers d(a) and d(a) defined in the above corollary are respectively the 
degree and bi-degree of a. 

Let d G N and d = (di,d 2 ) e N 2 . In pTTl (resp. SjHJ), we defined E d (resp. E d ) as the set 
of points which admit a representation of degree < d (resp. of bi-degree < d). According 
to the corollary, it can also be described as the set of elements of E with degree < d (resp. 
with bi-degree < d): 

Ed = {m + n/7 G E ; |m| + |n| < d} and Ed = {m + n/7 G -E ; \m\ < d\, \n\ < d 2 }. 
We can now easily compute the cardinality of these sets. 

Corollary 4.5. Let d G N and d = (di,d 2 ) G N 2 . Then, we have \E d \ = d 2 + d + 1 and 
I £ d I = 2d 1 d 2 + d 1 + d 2 + l. 

Proof. Denote by £ the set of all non-zero points (m,n) in Z 2 satisfying |m| + |n| < d 
(resp. |m| < di and \n\ < d 2 ). Define also C + to be the set of points (m,n) in £ for which 
777. + n/7 > 0. Then, E d \ {0} (resp. E d \ {0}) is in bijection with C + . As the sets C + 
and — C + form a partition of £ in two subsets of the same cardinality, it follows that the 
cardinality of E d (resp. of Ed) is 1 + \jC\/2, and the conclusion follows upon noting that |£| 
is 2d(d + 1) (resp. (2di + l)(2d 2 + 1) - 1). □ 

4.3. Representations by quads. Let a G E. For each d G N such that a G E d , we define 
the size s d (a) of a relative to d to be the largest size of a representation of a of degree < d 
(see §2.ip . Similarly, for each d G N 2 such that a 6 £d, we define the size Sd(ot) of a relative 
to d to be the largest size of a representation of a of bi-degree < d (see [|3]). The next 
proposition shows that, in order to compute the various degrees and sizes of a, it suffices to 
consider only representations of the form 

(26) a = a^- { + b^" 1 + C7" i ~ 2 

with i, a, b, c G N, and a > 1 if a 7^ 0. 

Proposition 4.6. Let d G N* and let d G N 2 \ {(0,0)}. Each a e E d \ {0} admits a 
representation i = (i±, . . . ,i s ) with degree d(i) < d and size s = s d (a) for which i s < %\ + 2. 
Similarly, each a G Ed \ {0} admits a representation i = (ii, . . . ,i s ) with bi-degree d(i) < d 
and size s = Sd(a) for which i s < i\ + 2. 

Proof. Let a G E d \ {0}. Put s = s d (a), and choose a representation i = (ii, . . . , i s ) of a of 
size s with minimal degree. We claim that i has all the required properties. First it satisfies 
d(i) < d by definition of s d (a). It remains to show that i s < i\ + 2. 
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To show this, we first observe that, for any pair of integers (p, k) with k > 1, we have 

k 

j-P + 7 -P-2^-2 = 7 -p-2 + / 7 -P" 2i j + 7 "P- 2fc . 

i=l 

Assuming that i s > %\ + 3, these formulas show that the point (3 = 7 - * 1 + 7~* s admits 
a representation j = (ji,...,j t ) with coordinates of the same parity as i s , size £ — 2 if 
z s = ii + 3, and size £ > 3 if i s > i\ + 3. In this case, Proposition 14.31 gives d(j) = d{(3) and 
also d(/3) < d(ii,i s ) with the strict inequality if i s — i\ + 3. Then, upon reorganizing terms 
in the decomposition 

a = ( T -ii + ■ ■ • + T -«) + ( 7 - J2 + ■ • • + 7~ is_1 ), 

we get a representation i' of a with degree d(i') = d(i)+d((3)—d(ii, i s ) and size s(i') = s+£— 2. 
If z s = «i + 3, we have d(i') < and s(i') = s in contradiction with the choice of i. If 
i s > i\ + 3, we find that d(i') < d(i) < d and s(i') > s = Sd(a) in contradiction with the 
definition of Sd(at). Thus, we must have i s < i\ + 2. 

This proves the first assertion of the proposition. The proof of the second assertion is 
the same provided that one replaces everywhere the word "degree" by "bi-degree" , and the 
symbol d by d. □ 

We define a quad q to be an expression of the form q — (i ; a, b, c) with i, a, b, c G N and 
a > 1. We say that a quad q as above represents a point a G if it satisfies (|26l) . Identifying 
it with the sequence formed by a occurrences of % followed by b occurrences of % + 1 and c 
occurrences of i + 2, the various notions of degree and size translate to 

di(q) = af(i - 2) + bf(i - 1) + 0/(2), d 2 (g) = a/(i - 1) + bf(i) + cf(i + 1), 
(27) , 

G?(g) = di(q) + d 2 (q) , d(g) = (rfi(g), rf 2 (g)) and s(g) = a + 6 + c. 

In this context, Proposition 14.61 shows that for any a E E* and any integer d > d(a) (resp. 
any integer pair d > d(a)), the integer Sd(a) (resp. Sd(a)) is the largest size of a quad of 
degree < d (resp. of bi-degree < d) which represents a. 

4.4. Sequences of quads. For each a G E*, we denote by Q a the set of quads which 
represent a, and, for each d G N 2 , we denote by Qd the set of quads of bi-degree d. Although 
we use the same letter for both kinds of sets, the nature of the subscript should in practice 
remove any ambiguity. As we will see these families have similar properties. We start with 
those of the first kind. 

Proposition 4.7. Let a G E* . The set Q a of all quads representing a is an infinite set 
whose elements have distinct size. If we order its elements by increasing size, then their 
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sizes form an increasing sequence of consecutive integers while their degrees, bi-degrees, and 
second partial degrees form strictly increasing sequences in N, N 2 and N respectively. The 
element of Q a of smallest size is the quad of degree d(a) and bi-degree d(a) associated to 



Proof. The relation 7 1 = 7 * 1 + 7 1 2 shows that, for each q = (i ; a, b, c) G Q a , the quad 



also represents a. This defines an injective map 6 from Q a to itself, which increases the size 
of a quad by 1. Since the size of any quad is finite and non- negative, this implies that any 
q G Q a can be written in a unique way in the form q = 9^(qo) where j G N and where go is an 
element of Q a which does not belong to the image of 9. The latter condition on q means that 
it is of the form q = (0 ; a, b, 0) with a,b > 1 or q = (i ; a, 0, c) with a > 1. According to 
Proposition 14. 11 there exists exactly one representation of a of that form and, by Proposition 
14.31 it has degree d(a) and bi-degree d(a). Thus q is the element of Q a of smallest size, and 
we can organize Q a in a sequence (^'(9o)) >0 where the size increases by steps of 1 Along this 
sequence, the degree, bi-degree and second partial degree are strictly increasing, since for any 
q = (i ; a, b, c) G Q a , the formula (EE} implies that d(%)) = d(g) + (2/(i - 1), 2/(i)). □ 

The proof of the above proposition provides an explicit recursive way of constructing the 
elements of Q a by order of increasing size: given any q G Q a the next element is 0(q). We 
will not use this explicit formula in the sequel, except in the proof of the second corollary 
below. 

Corollary 4.8. Let d G N* ; d G N 2 \{(0,0)}, and a G E* . If a G E d , then s d (a) is the size 
of the quad of largest degree < d which represents a. If a & then s^{a) is the size of the 
quad of largest bi-degree < d which represents a. 

Proof. Suppose that a G Ed. Then, by Proposition 14.61 the size Sd(a) of a relative to d is 
the largest size achieved by a quad of degree < d in Q a . By Proposition 14.71 this is also the 
size of the quad of largest degree < d in Q a . The proof of the assertion in bi-degree d is 
similar. □ 

Corollary 4.9. Let d G N* and a G E d . There exists one and only one representative 
(i ; a, b, c) of a which satisfies 



For this choice of quad, one has s^(a) = a + b + c. 

Proof. By Corollary 14 . 81 and the remark following Proposition 14. 7\ the integer Sd(ct) is the size 
of the unique quad q in Q a satisfying d(q) < d < d(9(q)). Upon writing q = (i ; a,b, c) and 



the representation of a given by Proposition \4-l\ 



(28) 




(29) 



d - 2f(i + 1) < af\i) + bf(i + 1) + cf{i + 2)<d. 
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using the formula ( 1281) for 0(q), the latter inequality translates into ( |29j) . and the conclusion 
follows. □ 

Proposition 4.10. Let d = (d\,d 2 ) G N 2 \ {(0,0)}. The set Qd of all quads of bi-degree 
d is a finite non-empty set whose elements have distinct size. If we order its elements by 
decreasing size, then their sizes form a decreasing sequence of consecutive integers while the 
points of Ed that they represent form a strictly decreasing sequence of positive real numbers. 
The element ofQd of largest size has size di + d 2 and represents the point d\-\-d 2 / n f, while the 
element of Qd of smallest size represents the point \d\ — da/71 j both points being of bi-degree 
d. The elements of Qd of intermediate sizes represent points of bi-degree < (di,d 2 — 1). 

Proof. The set Qd is not empty as it contains the quad 

f(0;di,d 2 ,0) if di > 0, 
% ~ d 2 ,0,0) if di = 0. 

Since /(— 2) = 1, /(— 1) = and f(j) > 1 for each j > 0, the formula (1271) for the bi-degree 
shows that go is the only element of Qd if di = or if d 2 = 0. As the proposition is easily 
verified in that case, we may assume that di and d 2 are positive. 

Define Q^ to be the set of quads q = (i ; a,b, c) of Qd with b > 1. Then, under our present 
assumptions, Qd is n °t empty as it contains the point go- Moreover, the recurrence relation 
for the function / combined with ( 12T1) shows that one defines a map ip : Q^ — > Qd by sending 
a quad q = (i ; a,b,c) G Q^ to 

{(i; a- 1,6- l,c+ 1) if a > 2, 
(i + 1; 6-l,c+l,0) ifa = land6>2, 
(z + 2; c+ 1,0,0) if a = 6 = 1. 
This map is injective and decreases the size of a quad by 1. Thus any q G Qd can be written 
in a unique way in the form q = ip^Wo) where j G N and where q' is an element of Qd which 
does not belong to the image of ip. This means that q' is of the form (0 ; a, b, 0) with a, b > 1 
or (1 ; a, 0, 0) with a > 1, and thus that q' = qo since its bi-degree is d. 

The above discussion shows that we can organize Qd in a sequence {^r ' (<Zo)) - =0 where the 
size decreases by steps of 1, starting from the element q of Qd of largest size d\ + d 2 , and 
ending with the element q t := ^(qo) of smallest size. Since the quad q t does not belong to 
Q^, it has the form (i ; a, 0, c) for some i > 0. In the notation of Proposition 14.11 it thus 
represents a point of C E \ E^ which, by Proposition 14.31 has bi-degree exactly d. 
Since d\ + d 2 / r y G E^ and \d\ — d 2 /^\ G E \ E^ are the only points of E of bi-degree d, 
we conclude that q and q t are respectively the representations of d\ + d 2 /^ and \d\ — d 2 /j\ 
coming from Proposition 14.11 All intermediate quads ip^{qo) with j = 1, ... ,t — 1 belong 
to Qj \ {go}- They have the form (i ; a, b, c) with i = and a, 6, c > 1, or with % > 1 and 
a,b > 1. Therefore, by Proposition 14.31 they represent points of bi-degree < (di,d 2 — 1). 
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Finally, for any given q — (i ; a, b, c) G Q^, the quad ip(q) given by f[3"Uj) represents the point 
(a — 1)7~* + (6 — 1)7~* -1 + (c + 1)7~*~ 2 of E* which, as a real number, is smaller than the 
point cry - * + bq~ l ~ l + cj~ l ~ 2 represented by q. Thus the points of E* represented by quads 
in Q d decrease (in absolute value) with the size of these quads. □ 

4.5. Proof of Theorem 13.11 We prove it in the following form. 

Theorem 4.11. Let d = (di, d 2 ) € N 2 . For each integer s > 0, define 

E d (s) = {a E E d ; s d (a) = s}. 

Then, for s > d± + d 2 , this set is empty while for < s < d\ + d 2 its cardinality is 

(31) |-^d( s ) | = 2 min{dx, d 2 , s, d\ + d 2 — s} + 1. 

Proof. We fix a choice of d% > and prove the theorem by recurrence on^ > 0. For d 2 = 0, 
we have E dl>0 = {0, 1, ... , di} and sa lt o{i) — i for i = 0, 1, . . . , d\. Therefore E dlt o{s) has 
cardinality 1 for < s < di and is empty for s > di, as asserted by the theorem. Suppose 
now that d 2 > and that the statement of the theorem holds in bi-degree (di,d 2 — 1). 

Fix an integer s > 0. In order to establish the formula in bi-degree (di,d 2 ), we first 
compare the sets Ed lt d 2 ( s ) an d Ed lt d 2 -i{ s )- 

1) According to Corollary 14.81 the points of Ed 1; d 2 ( s ) which do not belong to Ed lt d 2 -i( s ) 
are the elements a of E for which the quad of Q a of largest bi-degree < (di, d 2 ) has size s 
but does not have bi-degree < (di,d 2 — 1). They are therefore the points of E which are 
represented by an element of Qi t d 2 of size s for some integer i with < i < dx- Since by 
Proposition 14.71 the quads representing the same point have distinct second partial degrees, 
and since by Proposition 14.101 each Qi t d 2 contains at most one element of size s, we conclude 
that the cardinality of Ed 1 ,d 2 (s) \ Ed 1) d 2 -i(s) is the number of indices % with < i < d\ such 
that Qi,d 2 contains an element of size s. 

2) According again to Corollary I4.8[ the points of Ed lt d 2 -i(s) which do not belong to 
Ed 1 ,d 2 (s) are the points a of Ed lt d 2 -i for which Q a contains both a quad of size s and bi- 
degree < (dx, d 2 — l), and a quad of size s + 1 and bi-degree (z, d 2 ) for some i with < i < d x . 
The first condition however is redundant because if a e E dl ^ 2 -i is represented by a quad 
of size s + 1 and bi-degree {i, d 2 ) with < i < d\, then as the second partial degree 
increases with the size in Q a while the first partial degree does not decrease (by Proposition 
14.71) . the quad of Q a with largest bi-degree < (d\,d 2 — 1) must have size s. Thus, the set 
Ed 1 A 2 -i(s) \ Ed lt d 2 (s) consists of the points of Ed u d 2 -\ which are represented by an element 
of Qi,d 2 °f si ze s + 1 for some i with < % < d\. Moreover, according to Proposition I4.10[ 
for a given i e N all elements of Qi^ 2 represent points of bi-degree < {i, d 2 — 1) except for 
the ones of smallest or largest size. Therefore, the cardinality of E dl ,d 2 -i( s ) \ Ed lt d 2 (s) is the 
number of indices i with < i < di such that Qi,d 2 contains an element of size s and an 
element of size s + 2. 
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Combining the conclusions of 1) and 2), we obtain that the cardinality of Ed 1 ,d 2 (s) is equal 
to that of -Edi,d 2 -i( s ) P ms the number of indices i with < i < d\ such that Qi^ 2 contains 
an element of size s but no element of size s + 2. Since, by Proposition 14.101 the largest size 
of an element of Qi t d 2 is i + <^2 ; the latter condition on i amounts to either i + d 2 = s or both 
i + d 2 = s + 1 and i ^ (so that Qi^ 2 contains at least two elements and thus contains an 
element of size s). This provides the recurrence relation 



Combining this with the induction hypothesis for \Ed lt d 2 -i( s )\, we get \Ed lt d 2 { s )\ — if 
s > d± + d 2 and \Ed 1} d 2 ( s )\ = 1 if s = di + d 2 . If s < d\ + d 2 , it also provides the required 
value fl3T|) for \E dl ,d 2 { s )\ because the difference 



is if minjc?!, s} < minj^, d\ + d 2 — s} and 1 otherwise. Since min{<i2, d\ + d 2 — s} = 
c?2 — s + min{<ii, s}, this difference is therefore if s < d 2 and is 1 if d 2 < s < d\ + d 2 . □ 

4.6. Proof of Theorem 12.11 Similarly, we prove Theorem 12.11 in the following form. 
Theorem 4.12. Let deN. For each integer s > 0, define 

E d (s) = {a e E d ; s d (a) = s}. 
Then, for s > d, this set is empty while for < s < d its cardinality is 



Proof. We proceed by recurrence on d. For d = 0, we have E Q = {0} and since so(0) = 0, 
the theorem is verified in that case. Suppose now that d > and that the conclusion of the 
theorem holds in smaller degree. 

Fix an integer s > 0. Arguing in a similar way as in the proof of Theorem 14.111 we find 



1) Ed(s) \ Ed-i(s) consists of the points of E which are represented by a quad of Qi t d-i 
of size s for some integer i with < i < d; its cardinality is the number of indices i 
with < i < d such that Qi y d-i contains an element of size s; 

2) E d -i(s) \ E d (s) consists of the points of E d -\ which are represented by an element of 
Qi,d-i of size s + 1 for some i with < i < d; its cardinality is the number of indices 
i with < i < d such that Qi t d-% contains an element of size s and an element of size 




minjc?!, d 2 , s, d\ + d 2 — s} — minjc?!, d 2 — 1, s, d\ + d 2 — 1 — s} 




that: 



s + 2. 
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Thus, the cardinality of E d (s) is equal to that of E d _i(s) plus the number of indices i with 
< i < d such that Qi t d-i contains an element of size s but no element of size s + 2. Since 
the largest size of an element of Qi td -i is d, the latter condition amounts to either d = s or 
both d — s + 1 and i ^ 0. This gives the recurrence relation 

{0 if s < d — 1 or s > d, 

d ifa = d-l, 

d+1 if s = d, 

and from there the conclusion follows. □ 

5. Application to a dimension estimate 

The following result illustrates how the theory developed in Sections [2] and H] can be used 
to derive dimension estimates of the type that one requires in the construction of auxiliary 
polynomials. 

Theorem 5.1. Let d G N* and 5 G R with < 5 < ^d. Define Vd(S) to be the set of sequences 
21 G Q[£ (0) , £ (-1) ] <d satisfying < {X^] 5 . Then, V d (8) is a subspace of Q[£ (0) , X { -^]< d 
and its dimension satisfies 

(32) Cl (d6f /2 < dim Q V d (5) < 1 + c 2 (d5) 3/2 , 
for appropriate positive constants c\ and c 2 depending only on £. 

Proof. It is clear that V d (5) is a vector space over Q. Combining Lemma [2.61 with Theorem 
12.101 and then using Corollary 14.91 we find that his dimension is 

dim Q V d (6)= Yl (2sd(a) + l) = l + ^(2(a + 6 + c) + l), 

{a£E d ;a<S} S 

where the rightmost sum runs over the set S of all quads (i ; a, b, c) satisfying the system of 
inequalities 

(33) d - 2f(i + 1) < af(i) + bf(i + 1) + cf{i + 2)<d, 

(34) a7 _i + try-*- 1 + c^ 2 < 5. 

For each i G N, let Si denote the set of triples (a, b, c) G N 3 satisfying both a > 1 and the 

first condition fl3"3l . In the computations below, we freely use the fact that </(«)< Y 

for each i G N (as one easily shows by recurrence on i). 

Let i 6 N. We first note that Si is empty if f(i) > d (since we require a > 1). Assume 

that f(i) < d. Then each (a, b, c) G Si satisfies d/(2f(i + 2)) < a + b + c < dj f(i) and so 

d d , 2d 3d 3d 

< ^rr. r < 2(a + b + c) + 1 < — - + 1 < — - < 



r +2 - f(i + 2) - K ' - f(i) - f(i) - t*- 1 ' 

< ! < d 7 -«~2 < 07 "i + ^-i- 1 + C7 — 2 < d_i ^ d 



27 2, + 4 - 2 f(i + 2)' ~ 1 1 ~ f{i) 1 ~ 7 



2i-l 
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The second chain of inequalities implies that if some (a, b, c) G Si satisfies then we must 
have d < 2^ 2l+i 5. On the other hand, it also implies that any (a, b, c) G Si satisfies ( |34l) as 
soon as d < 7 2i_1 5. Therefore, we obtain 

ieJ 1 iei 1 

where I denotes the set of integers i > such that f(i) < d and d < 2 / ~f 2l+4 5, and J the set 
of integers i > such that f(i) < d and d < 7 2t ~ 1 5. 

Again, let i G N. For each choice of integers a > 1 and c > with a/(i) + c/(z + 2) < d, 
there are exactly two choices of integer b > such that (a, 6, c) satisfies (1331) . or equivalently 
such that (a,b,c) G 5*j. This means that the cardinality \Si\ of Si is twice the number of 
points (a, c) G N* x N satisfying a/(i) + c/(z + 2) < d. Since /(i) < d for each z in / or 
J, this number is non-zero and a short computation provides absolute constants C3 > and 
C4 > such that \S%\ < C3<i 2 7 _2 * for each z G / and |Sj| > c^d 2r )~ 21 for each z G J. 

If / is not empty, it contains a smallest element i$ and so the above considerations give 



dim:; V d (S) < 1 + 3c 3 d 3 ]T 7 - 3i+1 < 1 + 4 7 c 3 (d7~ 



-i \3 
i=i 

Since d < 2 7 2io+4 5, we find that d-f~ io < {2 1 A d5) 1 ' 2 and so dim Q y d (5) < 1 + c 2 (d<5) 3 / 2 with 
C2 = 127 7 C3. If / is empty, Vd(5) has dimension 1 and this inequality still holds. 

Let jo denote the integer for which 7 2 - ?0 ~ 3 <5 < d < 7 2jf ° -1 (5. Since 5 < 'yd, we have jo > 0, 
and thus j belongs to J if and only if /(jo) < ^- In that case, using ^io-Zfi < ^ we g nc [ 
that d7 _J '° > 7" 3 / 2 (d5) 1 / 2 and so 

dim Q V d (6) > \Q j0 \^ > 7- 2 c 4 (rf7- JO ) 3 > T 7 c,(d5Y' 2 . 

If jo ^ J, then we have d < /(jo) < 7 3 ' , thus < 7 JO <5 < 7~- ;o+3 (i < 7 3 and therefore 

dim Q ^(5)>l>7^ 9/2 (^) 3/2 , 
showing that the lower bound in (1521) holds with Ci = min{7 _7 c 4 , 7 -9 / 2 }. □ 

6. A COMPLEMENTARY RESULT 

Following a suggestion of Daniel Daigle, we show: 



Theorem 6.1. The ideal I defined in Lemma \2.J\ is a prime ideal of rank 3 of the ring 
# = Q[X,X*]. 

This provides a proof of Corollary 12.131 which is independent of the combinatorial argu- 
ments of Section |H Indeed, it follows easily from the considerations of § §2.4H2"31 that Xq°\ 
Xq -1 '* and X-i are elements of Q[X^, which are algebraically independent over Q. 

On the other hand, the evaluation map tt: R — > Q[%(°\ X* -1 )] defined by (|T3|) induces a 
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surjective ring homomorphism 7f: R/I — > Q^X 1 - ^, X'- -1 -'] and, if we take for granted Theorem 
16.11 the quotient R/I is an integral domain of transcendence degree 3 over Q. Therefore, as 
Daniel Daigle remarked, this means that 7f is an isomorphism. This not only proves Corollary 
12. 131 but also: 

Corollary 6.2. The ring Q^\J£^\ X^ 1 ^] is an integral domain of transcendence degree 3 
over O. 



In order to prove Theorem 16. 11 we first note that we may assume, without loss of generality, 
that the coefficient a 2 .2 of the matrix M is non-zero. Indeed, as we saw at the end of §2.2[ 
at least one of the coefficients a% t i or 02,2 of M is non-zero. If a 2 .2 = 0, then a^i 7^ and 
we replace / by its image under the ring automorphism of R which sends Xj to X 2 _j and 
X* to X|_j for % — 0, 1, 2. This automorphism fixes the first two generators det(X) — 1 and 
det(X*) — 1 of / and maps $(X,X*) to a polynomial of the same form with the coefficient 
a 2,2 replaced by — 7^ 0. 

Now, let V, V* and W be indeterminates over R. We put a N 3 -grading on the ring 
i?3 := R[V, V*, W] by requesting that each variable is multi-homogeneous with multi-degree: 
deg(X) = (l,0,*), deg(X*) = (0,l,i) fori = 0,1, 2, 
deg(V) = (1,0,1), deg(V*) = (0,1,1) and deg(W) = (0, 0, 1). 

A polynomial P in R% is thus multi-homogeneous of multi-degree (d\, o?2, d^) if and only if, 
in the usual sense, it is homogeneous of degree d\ in (X, V), homogeneous of degree d2 in 
(X*,V*), and if its image under the specialization Xi h-> W i X u X* i-> W l X* (i = 0,1,2), 
V ^ WV and V* i-> WV* belongs to W d:i R[V, V*}. In this case, d 3 is called the weight of P. 
Let J3 denote the ideal of -R3 generated by 

F = detfX) - V 2 , 



p* 
G 



detfX* 



(V 

X* X{ 
Xq Xi 



*\2 



w 2 + 



01,2 



X* X* 

x x x 



+ a 2 i 



X* X* 
Xi X2 



W + a 2 , 2 



X* 
Xi 



X* 



x 2 

Since F, F* and G are respectively multi-homogeneous of multi-degree (2, 0, 2), (0, 2, 2) and 
(1, 1, 3), the ideal J 3 is multi-homogeneous. By construction, it is mapped to I under the R- 
linear ring homomorphism from i? 3 to R sending V, V* and W to 1. Moreover, any element 
of / is the image of a multi-homogeneous element of ^3 under that map. Therefore, in order 
to prove Theorem 16.11 it suffices to show that J3 is a prime ideal of R3. In preparation to 
this, we first establish the following lemma where, for any / e -R3, we define (I3: f) = {a£ 
Rs;af E I 3 }. 

Lemma 6.3. The polynomials F , F* and G form a regular sequence in R3 and we have 
(I 3 : X ) = (I 3 : X*) = I 3 . 
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Proof. Each polynomial in the sequence Xq, Xq, F, F*, G depends on a variable on which 
the preceding polynomials do not depend (V for F, V* for F* and W for G). Therefore, 
this sequence generates an ideal of R 3 of rank 5, and so it is a regular sequence. Since these 
polynomials are homogeneous, any reordering of this sequence remains a regular sequence, 
and the conclusion follows. □ 

We now complete the proof of Theorem 16.11 by showing: 

Lemma 6.4. The ideal J 3 is prime of rank 3. 



I 3 for each / 6 S. So, it is equivalent to prove that S l I% is a prime 



Proof. Let S denote the multiplicative subset of R3 generated by X and Xq. By Lemma 
6.31 we have (J 3 : /) 

ideal in the localized ring S^R^. Since 

X 2 -X^\Xl + V 2 ) and (X*)~ l F* 



X^F 



X* 



(* T l ((*i) 2 +(n 2 )> 



this amounts simply to showing that G is mapped to a prime element under the ring en- 
domorphism of S^Rs sending X 2 to X Q \X 2 + V 2 ), X* to (X*)- 1 ^*) 2 + (V*) 2 ), and all 
other variables to themselves. The image of G takes the form (XoXq)~ 1 H where 



H = aii 



X* 
X n 



X* 
X 1 



X X*W 2 + a lt2 



X*X* (X*) 2 + (V*) 2 

Xq X\ 



X W 



+ a 2 , 



X* XI 
X Xx Xf + V 



x*w 



^2,2 



X*X{ {xif + (V*) 2 
XoXi X 2 + V 2 



Since R3 is a unique factorization domain, we are reduced to showing that H is an irreducible 
element of -R3. Moreover, since H is multi-homogeneous of multi-degree (2,2,3), it suffices 
to prove that H has no non-constant multi-homogeneous divisor of multi-degree < (2,2,3). 
Let Ho denote the constant coefficient of H, viewed as a polynomial in W. Since 02,2 7^ 0, it 
is non-zero. It takes the form Hq = a(V*) 2 — b where a and b are relatively prime elements 
of R[V] such that ah is not a square in R[V] (ab is divisible by X but not by Xq). Therefore 
Hq is irreducible. If A is a multi-homogeneous divisor of H of multi-degree < (2, 2, 3), then 
the constant coefficient Aq of A (as a polynomial in W) is a divisor of Hq. Since Ho is 
irreducible and multi-homogeneous of the same multi-degree (2, 2, 3) as H, it follows that 
A is a constant and therefore that A itself is a constant. □ 

Acknowledgments. The authors thank Daniel Daigle for several interesting discussions on 
the topic of this paper, for the suggestion mentioned above, as well as for suggesting the 
formalism of the ring of sequences & in Section 2. 
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